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Abstract 

Time  optimal  control  problems  for  a  class  of  linear  multi-input  systems  are 
considered.  The  problems  are  regularized  and  the  asymptotic  and  monotone 
behavior  of  the  regularisation  procedure  is  investigated.  For  the  regularised 
problems  the  applicability  of  semi-smooth  Newton  methods  is  verified.  First 
numerical  tests  are  presented  which  show  that  the  proposed  approach,  differ¬ 
ently  from  other  methods,  does  not  rely  a-priory  information  of  the  switching 
structure. 


1  Introduction 

This  paper  addresses  time  optimal  control  for  a  class  of  linear  multi-input 
controls  systems  for  ordinary  differential  equations.  Due  to  their  practical 
relevance  and  inherent  structural  difficulties,  time  optimal  control  has  been 
receiving  a  considerable  amount  of  attention  for  decades.  Much  of  the  lit¬ 
erature  up  to  the  late  sixties  is  covered  in  [HL],  Many  recent  results  can 
be  found  or  are  referenced  in  [BPW,  KLM,  MO].  Time  optimal  control  for 
infinite  dimensional  systems  is  considered  in  [Fa],  for  example. 

The  optimality  system  associated  to  time  optimal  control  problems  with 
pointwise  constraints  on  the  controls  is  complicated  due  to  lack  of  smoothness 
of  the  optimal  controls.  In  fact,  the  first  order  optimality  system  for  time 
optimal  control  problems  contains  a  multivalued  operation  which  impedes  the 
use  of  fast  numerical  methods.  For  this  reason  we  introduce  a  regularization 
to  the  time  optimal  problem.  In  section  2  the  behavior  of  the  solutions  of 
the  regularized  problems  as  the  regularization  parameter  e  tends  to  zero  is 
investigated.  In  particular  monotonic  structure  of  the  solutions  with  respect 
to  e  is  shown.  An  optimality  system  for  the  regularized  problems  is  derived 
under  a  condition  which  is  stronger  than  controllability  and  weaker  than 
normality.  The  optimal  controls  of  the  regularized  problems  are  W1,oc  regular 
and  converge  to  a  minimum  norm  solution  of  the  original  problem  as  the 
regularization  parameter  tends  to  zero. 

The  optimality  system  of  the  regularized  problems  is  still  not  C 1  so  that 
second  order  methods  with  local  quadratic  convergence  order  are  not  directly 
applicable.  However,  sufficient  conditions  will  be  obtained  in  section  3  which 
imply  that  semi-smooth  Newton  methods  [IK2]  are  wellposed  and  locally 
superlinearly  convergent. 

Section  4  contains  a  brief  description  of  numerical  results.  We  compare 
the  chosen  regularization  to  an  alternative  one,  which  has  stronger  regular¬ 
ization  properties.  Since  the  optimal  controls  of  the  original  time  optimal 
problems  are  typically  not  continuous,  it  appears  that  our  choice  of  regu¬ 
larization  which  leads  to  W1,oa  regularized  controls  is  preferable  over  other 
regularization  strategies  which  provide  smoother  controls.  More  detailed  nu¬ 
merical  tests  are  available  in  [XK], 

Let  us  note  that  the  approach  that  we  propose  for  solving  time  optimal 
problems  deviates  from  traditional  approaches,  which  are  frequently  grouped 
into  direct  and  indirect  methods.  Indirect  methods  based  on  multiple  shoot¬ 
ing  techniques  [Ke]  solve  the  two  point  boundary  value  problem  describing 
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first  order  necessary  conditions.  Equipped  with  a  good  initial  guess  for  all 
unknowns,  including  the  switching  function,  the  shooting  method  is  reported 
to  converge  fast  and  to  generate  very  accurate  solutions.  The  methods  that 
we  propose  also  originates  from  the  first  order  condition,  but  differently  from 
the  shooting  method  it  does  not  require  accurate  information  on  the  switch¬ 
ing  structure  in  advance. 

Direct  methods  on  the  other  hand,  consider  time  optimal  problems  as  a 
genuine  nonlinear  programming  problems.  They  are  used  in  several  variants, 
which  frequently  involve  reparametrization  of  the  controls  as  the  unknowns. 
The  new  unknowns  can  be  the  switching  times  as  in  [MB]  or  the  arc  durations 
as  in  [KN], 


2  The  time-optimal  problem  and  its 
regularization 


Consider  the  time-optimal  control  problem  for  the  linear  multi-input  system 


f  minT>0  f0T  dt 


( P )  { 


subject  to 


■^x(t)  =  Ax(t)  +  Bu(t)i;  \u(t)\{oo  <  1,  ;r(0)  =  xq,  x(t)  =  x\. 


where  A  G  Rnx",  B  G  Rnxm,  x0  G  Rn,  X\  G  M”  are  given,  u(t)  G  Mm,  u  is 
measurable,  and  |  •  |^oo  denotes  the  infinity-norm  on  Rm.  The  columns  of  B 
are  denoted  by  6.,;.  It  is  assumed  that  i  can  be  reached  in  finite  time  by  an 
admissible  control.  Then  ( P )  admits  a  solution  with  optimal  time  denoted 
by  r*,  and  associated  state  x*  and  control  u*. 

The  first  order  optimality  system  for  ( P )  can  be  expressed  in  terms  of 
the  adjoint  p  and  the  Hamiltonian 


H(x,  u,p0,p)  =  po+pT{Ax  +  Bu ), 
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as 


(2.1) 


x  =  Ax  +  Bu ,  rc(O)  =  x0l  x(r)  =  aq, 

—p  =  ATp , 

u  =  a,Tgmm\v\m<1H{x,v,p0,p),  a.e.  in  (0,r), 
Po  +  p{t)T (Ax(t)  +  Bu(t))  =  0,  p0  >  0, 


where  the  superscript  T  denotes  transposition,  see  e.g.  [MS],  chapter  V,  pg. 
109,  110.  Further  p  is  not  identically  0,  so  that  there  exists  a  nontrivial 
vector  q  G  M"  such  that 

p{t)  =  exp  {AT (t  —  t))  q. 


Due  to  the  special  structure  of  H  the  optimal 


control  can  be  expressed 


as 


(2.2)  m  =  —cr(bjp)  =  -a(bjex p  {-AT{r  -  t))  q), 

where  a  denotes  the  coordinate-wise  operation 


{—1  if  s  <  0 

[—1,  1]  if  s  =  0 

1  if  s  >  0. 


The  last  equation  in  (2.1)  holds  everywhere  rather  than  a.e.  on  [0,r],  In 
fact,  p  and  x  are  continuous  and  p(t)T Bu(t)  =  —  \p{t)Tbi\- 

Let  us  recall  the  notions  of  controllability  and  normality,  which  will  be 
referred  to  below. 


(2.4) 


The  pair  {A,  B)  is  called  controllable  if 
rank  {B,  AB, ....  An~1B}  =  n 


{The  pair  (A,  B)  is  called  normal  if  (A,  bA 
is  controllable  for  all  columns  6*  of  B. 

Normality  of  {A,  B)  implies  controllability.  Moreover,  if  ( A,  B )  is  normal, 
then  the  optimal  control  u*  to  ( P )  is  unique,  it  is  bang  bang,  and  piecewise 
constant,  see  e.g.  [MS,  HL], 
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The  requirement 


(2.6) 


Pa  >  0 


is  referred  to  as  strict  transversality.  In  this  case  it  can  be  assumed  that 
Po  =  1,  which  can  be  achieved  by  scaling  q.  If  strict  transversality  holds  then 
(x*,u*,t*)  is  a  strict  local  minimum,  in  the  sense  that  there  exists  S  >  0 
such  that  xi  is  not  in  the  attainable  set  for  t  £  (r*  —  6,  r*),  [HL],  pg.89. 

With  (2.6)  holding,  we  can  express  the  optimality  condition  as 


(2.7) 


x  =  Ax  +  Bu,  ;r(0)  =  xo,  x(r)  =  Xi, 

—p  =  ATp, 

< 

u  =  &Tgmm\v^QH(x,v,p),  a.e.  in  (0,r), 
1  +  p{t)t  (Ax(t)  +  Bu(t ))  =  0. 


Here  we  eliminate  the  variable  po  from  the  notation  for  II  since  it  was  fixed 
to  be  1. 

Introducing  the  transformation  t  =  |  and  setting 


x(t)  =  x(rt)  =  x(t),  p(t)  =  p(rt)  =  p(t ),  u(t)  =  u(rt)  =  u(t% 


we  obtain  the  following  equivalent  system  to  (2.7),  where  for  the  ease  of 
presentation  we  omit  the  superscripts  A: 


(2.8) 


x  =  t(Ax  +  Bu),  ;r(0)  =  xq,  x(l)  =  aq, 

— p  =  rATp, 

< 

ii  argmin  ,  // (:r,  <;..//).  a.e.  in  (0,r), 
l+p{l)T(Ax{l)  +  Bu{l))  =  0. 


The  non-differentiable  operation  involved  in  characterizing  the  optimal 
control, 

u  =  - <j{BTp ), 

compare  (2.2),  prohibits  the  use  of  Newton-type  methods  for  solving  (2.8) 
numerically. 
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Therefore  a  family  of  regularized  problems  given  by 
minT>0  f0T(  1  +  |  \u{t)\2)  clt 
( P£ )  subject  to 

^a(t)  =  Ax(t)  +  Bu(t),  //(/.)  ,<*■  <  1,  :r(0)  =  ay,  %(T)  =  ay, 

with  s  >  0  is  considered.  The  norm  |  •  |  used  in  the  cost-functional  denotes 
the  Euclidean  norm.  It  is  straightforward  to  argue  the  existence  of  a  solution 

(  T-:;  M'ei  ■ 

Convergence  of  the  solutions  (ay,  p£ ,  u£,  rE)  of  (P£)  to  a  solution  (a*,  p*,  u*,  r*) 
of  (P)  is  considered  next.  Note  that  r*  is  unique. 

Proposition  2.1.  For  every  0  <  e0  <  $i  and  any  solution  (r*,  n*)  of  (P) 
we  have 

(2.9)  V<T,a<T„  <T*(l  +  y), 

(2-10)  |  |l2(0,  Tfl )  T  |l2(0,teo)  T  |w  | L2(0,  r* )  • 

//  n*  is  a  bang-bang  solution,  then 

(2.11)  0  <  |«*|i3(0lT.)  -  l^ll2(o,rs-)  <  meas  {*  e  [0,  t*]  :  \ue(t)  |  <  1} 

/or  every  s  >  0. 
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where  we  used  the  fact  that  the  pair  (rSo,u£o)  is  optimal  for  (P£o).  Adding 
|(ei  —  co)  fo"1  |ttfl|2  dt  on  both  sides  implies  that 

ti  fTsi  £  n  fTs°  ru 

(2.12)  Teo  +  y  J  [uEl\2  dt+'-j  (  J  I u£0 12  dt  -  j  I u:ei  I2  dt ) 

<  (1  +  y  \uSl\2)dt  <  r£0  +  \ueo\2dt. 

Estimating  the  first  by  the  last  expression  in  (2.12)  implies  that 

[T?i  rT?  o  r.x  rT?o 

%(  /  |«£l  |2  dt  —  /  |u£o|2  dt)  <£o(  \u£l\2dt-  \u£0\2dt), 

Jo  Jo  Jo  Jo 

and  hence 

(2-13)  |^£i  I  L2(0,  Tsj )  <  Ko|l2(0,teo)- 

Estimating  the  first  by  the  second  expression  in  (2.12)  we  obtain 

,  t0  p  |2  ^  i  I  |2 

+  2  l^o  1^2(0,  t-£0  )  A  t£i  +  9  Im£iIl2(oiTsi) 


and  by  (2.13) 

r*  <  r£0  <  r£l. 

These  estimates  imply  (2.9)  and  (2.10). 

If  u*  is  bang-bang,  then 

0  <  |  U  |L2(0jT»)  —  I  I X/2  (0,Te ) 

<  /pG(0,r*):|«sp)|<i}(1  -  M)\2)dt  <  measjt  e  (0,r*)  :  \ue(t)\  <  1}, 
so  that  (2.11)  holds. 

□ 


Theorem  2.1.  For  e  — >  0+  we  have  t£  — >  t*  and  every  convergent  subse¬ 
quence  of  solutions  {{u£,  .r£)}£>o  to  (P£)  converges  in  L2  (0,  r£;  Mm)  xbb1,2(0,  re;  Kn) 
to  a  solution  ( u *,  x*)  of  ( P ),  where  u*  is  a  minimum  norm  solution. 
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Here  convergence  of  u£  to  u*  is  defined  as 


U£(t£  t )  —  U*(t*  t)\2  (It  — )■  0 
and  analogously  for  {ay},  and  for  weak  convergence. 

Proof.  The  first  claim  follows  from  Proposition  2.1.  Since  {«e(Te.-)}e>o  and 
{a£(rg-)}£>o  are  bounded  in  L2( 0,  1;  R™)  and  !F1,2(0,  1;  R"),  there  exist  weak 
accumulation  points  u*  E  L2(0,r*;Rm),  and  a*  E  1F1,2(0,  r*;  Rra).  Subse¬ 
quently  we  avoid  subsequential  indices.  Passing  to  the  limit  in  x£(r£-)  = 
r,;(.  b\;(r.  •)  +  Bu£(t£  •))  and  ay(0)  =  x0,  x£(r  )  =  ay  it  follows  that  a*  is 
admissible.  Due  to  weak  closedness  of  { u  E  L2( 0,  l;Rm)  :  \u(x)\i°o  <  la.e.} 
we  have  that  u*  is  admissible  as  well.  Since 

lim  t£  T  ~  (  \u£\2  dt  =  t*, 

£->o+  2  J o 

the  triple  (r*,  u*,  x *)  is  optimal  for  (P).  By  Proposition  2.1  and  weak  lower 
semi-continuity  of  norms 

(2.14)  lim  sup  K|l2(o,T£)  <  |«*ji2(0,T*)  <  Jim  inf  \u£\L2(0,T£) 

and  hence  lim  .;>0  |«£|L2(o,rf;Km)  =  |^*|l2(o,t;*^)-  As  a  consequence  u£  and 
x£  converge  strongly  in  L2(0,  re),  respectively  1P1,2(0,  re;  Mn),  to  u*  and  x*. 
Let  u  denote  another  optimal  control  for  (P)  with  u  <  |m*|.  Then  by  (2.10) 
and  (2.14) 

linrsup  \ue\L2(^TE.Rm)  <  |  h |l2(o,t* ;Mm)  <  I'm*  |l2(o,t* ;Mm)  <  lim  inf 

which  is  a  contradiction.  Consequently  (P)  has  a  minimal  norm  control  and 
the  claimed  strong  convergence  properties  hold.  □ 

Corollary  2.1.  If  (2.5)  holds,  then  the  solution  u*  to  (P)  is  unique,  it  is 
bang-bang,  and  u£  — >  u*  in  L 2  as  e  ^  0+. 

Proof.  (2.5)  implies  that  the  solution  to  (P)  is  unique  and  it  is  bang-bang. 
The  remainder  of  the  corollary  follows  from  Theorem  2.1.  □ 
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We  turn  to  the  optimality  condition  for  (P£).  Let 

{—1  if  s  <  —£ 

f  if  |  S I  <  £ 

1  if  s  >  e. 

If  a£  is  applied  to  a  vector,  then  it  acts  coordinate-wise. 

We  shall  use  a  controllability  assumption  which  is  stronger  than  control¬ 
lability  and  weaker  than  normality. 


(HI)  There  exists  i*  such  that  (A,bi*)  is  controllable. 

Theorem  2.2.  Assume  that  (HI)  holds  and  let  (x£ru£fr£)  be  a  solution  of 
{Ps).  If  there  exist  i]  >  0  and  an  interval  p*  C  (0, 1)  such  that 

(2.16)  |  (ue)i*(t) |foo  <1  —  77  for  a-e-  t  G  P*, 


then  there  exists  an  adjoint  state  p£  such  that 

[ =  Axe  +  Bue ,  .!•  (0)  =  ./•(,.  .r  ( r  )  =  Xi 


(2.17) 


I  -p£  =  AT p£ 
u£  =  -<j£{BTp£) 

kl  +  f  |w(^)||m  ••  /Mr,)7  (,l.r.(r,)  +  Bu£{t£ ))  =  0. 


Proof.  We  use  a  Lagrange  multiplier  argument  for  the  reparameterized  for¬ 
mulation  of  (Pe)  which  is  given  by 


(2.18) 


minr>0  l  '  (t  +  y  [nil  ]  |2)  dt 
\  subject  to 

=  r(Ax(t)  +  Bu(t)),  u  G  C,  L(0)  =  xq,  x{1)  =  x i , 


where  C  =  {u  G  L2(0, 1;M™)  :  |u||oo  <  1}.  Here  (u,r)  are  treated  as 
independent  and  x  as  dependent  variable.  Further  u  G  C  is  considered 
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as  explicit  constraint  and  a  Lagrange  multiplier  /i0  is  introduced  for  the 
constraint  e(u,r)  =  x(l)  —  X\  =  0.  The  resulting  Lagrangian  is 

r1  T£ 

£{u,t,h o)  =  J  (t  +  —  \ii{t)\2)  clt  +  [Iq{x{1)  -Xi), 

where  7(1)  is  defined  through  the  differential  equation  and  the  initial  condi¬ 
tion. 

We  now  argue  that  e  :  C  x  R  c  L2(0,  l;Mm)  satisfies  the 

regular  point  condition  in  the  sense  of  Maurer-Zowe  [MZ,  IK2] .  Thus  we 
have  to  verify  that 

(2.19)  0  G  int  {e'{u£,  re)  ({C  -  u£)  x  M)  }, 

where  e'(ue,re)  denotes  the  linearisation  of  e  at  (0,  ( •  r. ).  r.:). 

Considering  e'(u£,T£)  in  directions  St  =  0  and  Su  satisfying  (Su)i  =  0  for 
i  7^  i*  and  (5u),;»  =  0  in  (0, 1)  \  (a,  a  +  5),  with  (a,  a  +  S)  :=  /*.  ,  we  find 

e'(us,  t£)(5u  -  us,  0)  =  J“+5  eT-4d-fi  T  ( Su{t )  -  ue(t))i*  clt 

=  fy  erA(s~t)  bj*  ( Su(t  +  a)  —  u£(t  +  «)),;*  dt, 

where  6,;*  =  reTA^l~5~°^bi* .  Then 

rs 

e'(us,  t£)(8u  -  u£,  0)  =  /  erA{d~t)bi*(Su(t )  -  u£(t))i*  dt, 

Jo 

where  Sui*(t )  =  5uj*(t  +  a),  ue,i*(t)  =  +  a).  Note  that  by  (2.16) 

{(5fi-  u£)i*  :  [0,5]  ->■  M1  I  |(5fi)j.  I  <  1}  D  5  :=  {«  :  [0,5]  ->■  M1  ,  |u[  < 

Observe  that  controllability  of  ( A ,  5**)  implies  that  (M,  5.,*)  is  controllable  as 
well.  Controllability  of  the  single  input  system  ( A ,  b,> )  implies  that 

rs 

(2.20)  0  G  int{  /  eTA^s~t')bi*v  dt  \  v  G  S}. 

Jo 

In  fact  the  set  on  the  right  of  (2.20)  contains  0  and  it  has  nonempty  interior, 
see  e.g.  [LM],  page  77,  133.  Moreover,  if  0  was  a  boundary  point  of  this 
set,  then  the  corresponding  control  v  =  0  is  an  extremal  control,  which  is 
impossible,  e.g.  [LM],  page  133.  Now  (2.20)  implies  (2.19). 
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With  the  regular  point  condition  satisfied,  we  can  conclude  the  station¬ 
ary  properties 


(2.21) 


d-'T  (  .1  1  do )  0  , 

Cu(u£,t£,  fio)(Su  —  ue)  >  0  for  all  5u  with  |5n||oo  <  1. 


From  the  second  property  in  (2.21)  we  have, 

( eu£  +  BTerAT^t^i0)(Su  —  ue)  dt  >  0. 

Setting 

(2.22)  p(t)  =  e~rATiq  with  q  =  ei  A' 

this  implies 

(su£  +  BTp£)(8u  —  u£)  dt  >  0, 

for  all  Su  as  in  (2.21).  The  second  and  third  claim  in  (2.17)  follow  with 
Pe(t)  =Pe{T-lt). 

Exploiting  the  first  property  in  (2.21)  implies  that 


£t(us,  t£,  do)  =  1  +  f  fo  \u£\2  dt 

+/j% (AerAx0  +  /q1  Bue(t)  dt  +  f A  .4(1  —  /)cT 'sl  ,]tBii£(I)  dt) 

=  1  +  I  /q1  \u£\ 2  dt  +  //,( (  A(P'd-  •  T  '.r(l  | 

Jq1  (  ;  ,n  d/j, -)_(/)  (it  _|_  J^1  Tc7  ,;  l  f*  eTAd^sWBu£(s))  ds  dt  =  0. 


This  implies 

(2.23)  Cr{ue,Te,p0 )  =  1  +  77  /  ~dt  I  /  pT{t)  (Ax£{t)  +  Bue(t)]  dt  =  0. 

From  u£  =  —a£(BTp£)  we  conclude  that  u£  e  TT1,oo(0,  r£;  Rm). 

We  introduce  the  Hamiltonian  for  ( /l )  as 

iT(.r,  u,p)  =  1  +  ^Mim  +  pT(Wc  +  Fm). 
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It  is  constant  along  the  optimal  solution.  In  fact  we  have  almost  everywhere 
on  (0, 1) 


-JjHe(xe,  Ue,p£)  =  £UTE4:Ue  +  fA  +  £  AtA  +  pf  Bj-. 


u£ 


=  ( su£  +  BTp£)Tj-u£  =  0 


Combined  with  (2.23)  this  implies  that 


1  + 


Up 


+  pf (Ax g.  +  Bue )  =  0  on  [0, 1]. 


This  implies  the  claim.  □ 

The  proof  revealed  extra  regularity  of  up. 

Corollary  2.2.  Under  the  assumptions  of  Theorem  2.2  we  have  u£  G  ITi,oo(0,  t£;  Rm). 

Remark  2.1.  Condition  (2.16)  requires  that  the  modulus  of  at  least  one  of 
the  coordinates  of  ue  is  not  almost  everywhere  equal  to  1.  Once  it  is  known 
from  Corollary  2.2  that  u£  is  continuous  this  amounts  to  requiring  that  at 
least  one  of  the  coordinates  of  u*  switches  from  1  to  —1  or  vice  versa. 


Under  the  assumptions  of  Theorem  2.2  the  first  order  necessary  optimality 
condition  for  ( P£ )  after  the  transformation  t  — >  ^  is  given  by 


(2.24) 


[ x  =  t(Ax  +  Bu),  a:(0) 


Xq,  ic(l)  :  X] 


—p = rATp 


\u  =  - cr£(BTp ) 


h+  ||'u(l)|2  +p(l)T(Ar(l)  +  Bu(  1))  =  0, 


where  for  convenience  of  notation  the  dependence  on  e  and  the  superscript 
hat  were  dropped. 

In  the  following  section  we  shall  investigate  semi-smootli  Newton  methods 
for  solving  (2.24). 

We  close  this  section  with  a  simple  example  which  illustrates  some  of  the 
features  of  the  regularization  approach. 
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Example  2.1.  Consider  the  two-dimensional  time  optimal  problem  for  the 
simple  control  system 

X\  =  Ml 

x2  =  u2, 

with  so  that  A  is  the  zero,  and  B  the  identity  matrix,  with  initial  condition 
(1,  |)  and  terminal  condition  the  origin.  This  system  is  controllable  but  it  is 
not  normal.  The  optimal  time  is  r*  =  1,  the  first  coordinate  of  an  optimal 
control  is  uniquely  determined  u\  =  —  1  ,  with  associate  state  X\  =  1  —  t. 
There  are  infinitely  many  choices  for  optimal  solutions  u\ j  of  bang-bang  and 
non  bang-bang  type.  The  associated  constant  adjoints  are  {pi,p2)  =  (1,0). 
They  satisfy 

“  =  -a{p)  £  (  [  -~l  !]  )  • 

The  transversality  condition  1  +  pTBu  =  0  is  satisfied. 

For  the  regularized  problem  we  find  t£  =  1.  Differently  from  the  un¬ 
regularized  problem  the  solution  to  the  regularized  problem  is  unique.  The 
optimal  control  and  trajectory  are  given  by 

(mi,m2)  =  (-1,  -.5)  with  (x1,x2)  =  (1  -t,  .5(1  -  £)). 

In  this  particular  example  the  solution  of  the  regularized  problem  does  not 
depend  on  s.  Note  that  this  solution  is  also  one  of  the  minimum  norm 
solutions  of  the  unregularized  problem.  The  adjoint  is  p£  =  (1  +  )■  It 

satisfies 

u  = -a e{pTe)=  ^ 

and  the  transversality  condition  1  +  |[m|2  +  pj Bu  =  0. 


3  Semi-smooth  Newton  method 


In  this  section  the  semi-smooth  Newton  method  for  solving  the  regularized 
optimality  system  (2.24)  is  described  and  analyzed.  It  will  allow  that  (2.24) 
can  be  solved  efficiently  inspite  of  the  fact  that  a£  is  not  differentiable. 
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Throughout  we  assume  (HI)  to  hold.  We  fix  e  >  0  and  denote  by 
( x£ ,  us,  re)  G  fT1,2(0, 1)  x  L2( 0, 1)  x  R  a  solution  to  (P£)  with  associated 
adjoint  pe  G  W1,2(0,  1).  It  is  assumed  that 

(H2)  there  exists  s  G  (0,1)  such  that  \-bJtpe(s)\  =  |(«£)i*(s)|  <  1, 
and 

(H3)  | bj p£(l)|  ^  £,  for  all  i  =  1, . . . ,  m. 

Assumption  (H2)  corresponds  to  (2.16),  where  we  now  use  the  fact  that  as  a 
consequence  of  Theorem  2.2  the  control  u£  is  continuous.  With  (H2)  and  (H3) 
holding  there  exists  a  neighborhood  UPe  of  p£  in  IT1,2(0, 1;  R"),  t  G  (0, 1),  and 
a  nontrivial  interval  (a,  a  +  8)  C  (0, 1)  such  that  for  p  G  UPe  we  have 

\bj  p(t)  |  ^  £  for  all  t  G  [t,  1],  and  i  =  1, . . . ,  m 

and 

(3.1)  \bj,p(t)\  <  e  for  t  G  (a,  a  +  5). 

We  set  U  =  {u  G  L2( 0,  1;RTO)  :  u\[t,  1]  G  W1,2(t,  l;Rm)}  endowed  with  the 
norm 

M  u  =  (lMli,2(0;1)  +  |m|  L2(f;1))2, 

and  introduce 

F  :  DF  C  X  — >  L2{ 0, 1;  R”)  x  L2(0, 1;  R”)  x  U  x  R"  x  R 

where 

=  W1,2(0, 1)  x  UPe  xU  x  R, 


and 


(3.2) 


X  =  IT1’2(0, 1 ;  R" )  x  ^^(0, 1;  R" )  x  U  x  R, 


F(a:,  p,  u,  t ) 


(  x  —  tAx  —  tBu  \ 

— p  —  r  ATp 
u  +  c TE{BTp ) 

,t(1)  —  xi 

\  1  +  ||u(l)|2  +p(l)T  (Aa’(l)  +  5«(1))  / 
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Note  that  F  =  (Fx, . . . ,  F5)  is  well-defined.  This  is  obvious  for  Fx,  F2  and 
F3.  For  FirF5  it  follows  from  the  fact  that  T/F1,2(0, 1)  embeds  continuously 
into  (7(0, 1).  Morevover  F(xs,  pe ,  u£,  re)  =  0.  We  shall  keep  x£(0)  =  x0  as 
an  explicit  constraint. 


Remark  3.1.  The  need  for  introducing  U  in  such  a  way  that  its  elements 
are  more  regular  at  1  is  due  to  the  fact  that  we  use  here  the  point-wise 
transversality  condition  rather  than  the  integrated  form  (2.23).  -  Condition 
(H3)  will  be  needed  to  prove  superlinear  convergence  of  the  Newton  iteration. 

Applying  Newton’s  method  to  F  =  0  is  impeded  by  the  non-differentiability 
of  a£.  We  use 


(3.3) 


GVe(s) 


if  |  -S'  |  <  £ 

if  Is  I  >  £ 


as  a  generalized  derivative  and  argue  that  the  resulting  Newton  iteration  is 
semi-smooth  and  hence  locally  superlinearly  convergent.  The  Newton  itera¬ 
tion  step  is  given  by 


(3.4)  DF(x,  p,  u,  r)(Sx ,  Sp,  Su,  Sr)  =  —F(x,  p ,  u,  r) 


where  Ar(0)  =  0  and  DF  denotes  the  Frechet-derivative  in  all  terms  of  F  ex¬ 
cept  for  p  — >■  ae(BTp ),  for  which  the  generalized  derivative  is  taken  according 
to  (3.3).  For  further  reference  we  give  the  detailed  form  of  (3.4): 


(3.5) 


"  jjjSx  —  tA5x  —  tB  8u  —  5t(Ax  +  Bu )  =  —Fx,  Ac(0)  =  0 
—  ^ Sp  —  rAT8p  —  5tAtp  =  —F2 
8u  +  Gae{BTp)BT5p  =  -F3 
Sx(  1)  =  — F4 

p(l)T{A  8x(l)  +  B8u(  1))  +  8p(l)T (Ax(l)  +  Bu(  1)) 

+£u{1)t8u{1)  =  —F5, 


where  the  coordinates  of  Ga£(BTp)BT8p  are  given  by  Ga£((BTp).{))(BT 8p)i, 
A  possible  initialization  may  consist  in  choosing  ((w)0,  r0),  setting  (x)0  as 
the  linear  interpolation  between  x0  and  xx,  and  determining  (p)0  such  that 
the  transversality  condition  and  the  adjoint  equation  are  satisfied. 
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We  now  briefly  summarize  those  facts  from  semi-smooth  Newton  methods 
which  are  relevant  for  this  paper.  Let  X  and  Z  be  Banach  spaces  and  let 
F  :  Dp-  C  X  — y  Z  be  a  nonlinear  mapping  with  open  domain  Dp. 


Definition  3.1.  The  mapping  F  :  Dp  C  X  Z  is  called  Newton- differentiable 
on  an  open  subset  U.  C  Dp,  if  for  each  x  G  IA  there  exists  a  generalized 
derivative  DFfx)  G  C(X,  Z)  and 


(3.6) 


lim  777 

h—± o  \h\ 


X 


| F(x  +  h)  -  F(x)  -  DF(x  +  h)h\z  =  0. 


Theorem  3.1.  Suppose  that  x*  G  IA  is  a  solution  to  Ffx)  =  0  and  that  F  is 
Newton- differentiable  in  an  open  set  U  containing x*.  If  further  {\\DFfx)l\\  : 
x  G  IA]  is  bounded,  then  the  Newton-iteration  x*+1  =  xfe  —  Z}F(xfe)_1  Ffxk) 
converges  q-superlinearly  to  x*,  provided  that  |x0  —  x*|  Y  is  sufficiently  small. 


For  the  statement  and  proof  of  superlinear  convergence  of  the  time-optimal 
control  problem,  some  further  notation  is  required.  For  ( x ,  p.  u,  r)  G  Dp  we 
define  A  G  ]^(n+1)x(n+i)  by 


where 

(3.7) 


/  An 
\  -421 


An 


c  !  !  '  B\jBr  <:Va  1  dt  G  Knx" 


(3.8) 


A12  =  s  1t  fie**1  1  II  \  i  II1  Jf  e  tATA  ^  At p(s)  ds  dt 
- 11  (Ax  +  Bu)  dt  g  K" , 


(.3.9)  t421  =  (Ax(l)+Bu(l))T-(pT(l)B+suT(l))Gae(BTp(l))BT  G  (Mn)T, 
with  xi  =  diag(xi1,  •  •  • ,  Xim)  an<4  Xh  the  characteristic  function  of  the  set 

h  =  h(p)  =  {t  :  \(BTp)i\  <  -},  i  1 .... .  m 

£ 
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which  is  nonempty  for  p  G  UPe  and  i  =  i*.  The  controllability  assumption 
(HI)  together  with  (H2)  imply  that  the  symmetric  matrix  An  is  invertible 
with  uniformly  bounded  inverse  with  respect  to  p  G  UPe  and  r  in  compact 
subsets  of  (0,  oo).  In  fact,  since  It*  (p)  D  (a,  a  +  S)  we  obtain  for  some  c  >  0 

pi  m 

An  =  e  'r  /  ^  b;\l:bjf!VA  1  dt 

Jo  ~7 


>  £  1  T 


»a+5 


jA(l—t)  h,tbT  l)  dt 


=  £-lTeTA(l-a)  /  e-rM  bi*bj.  errATtdt  erAT(1-“)  dt  >  c, 


where  wTe  use  that  the  controllability  Gramian 

f  e  7  ubj-  hj.  e~rATt  dt, 


is  uniformly  positive  definite  for  r  in  compact  subsets  of  (0,  oo). 

For  our  analysis  we  shall  utilize  the  fact  that  the  Schur  complement 

A21  A -q1  A\ 2  G  K. 


of  A  for  (x,  p,  u,  r)  in  a  neighborhood  of  (x£,  p£,  uf .  t£)  is  nontrivial.  If 
A2 1  4 ,  /  A12  A  0  at  ( x£ ,  p£,  u£,  t£ ),  we  cannot  conclude  that  A2 1  A , A12  A  0 
for  ( x ,  p,  u,  r)  in  a  neighborhood  of  ( xe ,  p£,  u£,  t£ ),  since,  while  with  (H2) 
holding,  A2\  is  continuous  with  respect  to  ( x ,  p,  u,  r)  G  X,  this  is  not  the 
case  for  Af A  and  A 12  due  to  the  term  \i-  We  therefore  assume  that 


(H4) 


there  exists  a  bounded  neighborhood 
<  U  C  D ]a  C  ^  of  ^  Psi  ^£5  j  Siiid  c  0  such  thuf 
,  1^21  A n1  A12 1  >  c  for  all  (x,  p,  u,  t)  G  U. 


Theorem  3.2.  If  (H1)-(H4)  hold  and  (x£,  u£,  r£)  denotes  a  solution  to 
(Ps)  with  associated  adjoint  p£,  then  the  semi-smooth  Newton  algorithm  con¬ 
verges  superlinearly,  provided  that  the  initialization  is  sufficiently  close  to 

{$£  •  P-  •  T-  •  he)- 
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For  the  proof  we  require  the  following  lemma. 

Lemma  3.1.  Suppose  that  (HI)  -  (H4)  hold.  Then  there  exists  a  constant 
C  such  that  for  every  (x,  p,  u,  r)  E  U,  and  F  E  L2(  0,  1)  x  f2(0,  1)  x  U  xl 

DF(x ,  p,  u,  t)(5x ,  Sp,Su ,  St)  =  —F 

admits  a  unique  solution  ( 5x ,  Sp,  Su,  St)  E  X  and 

(3.10)  |(<fa,  Sp,Su,  St) \x  <  C  \F\L2xL2xUxR. 


Proof.  Let  (x,  p,  u,t)  eU  and  note  that  system  (3.5)  is  equivalent  to 


(3.11) 

Sx(t)  =  Jq  eTA^t~s')(—Fi  +  rbSu  +  8t(Ax  +  Bu))  d.s 
5p(t)  =  erAT Sp(  1)  +  ff1  e~T AT ^~s\St AT p  —  F2)  ds 
<  Su  +  Ga£(BTp)BT  Sp  =  -F3 
8x(l)  =  — F4 

j){1)t{-AF4  +  B  Su{  1))  +  Sp(l)T(Ax(l)  +  Bu{  1))  +  &5u(  1)  =  -F5. 

On  Ij  we  have  ( Ga£(BTp))i  =  e^1.  The  third  equation  in  (3.11)  can  be 
expressed  as 

(3.12)  Su  =  —F3  —  £~lXi  BT Sp  a.e.  in  (0,1). 

Let  us  set 

Fi  =  -  f  eTA(1-t] F^t)  dt,  F2  =  £-1t  I  I  eTA(1-L)BxiBT  e-rAT^-s)F2{s)dsdt, 

Jo  Jo  Jt 

F3  =  -tB  f  eTA[l-t] F3{t)  dt. 


From  (3.12),  and  the  first  and  fourth  equation  in  (3.11)  we  have 
(3.13) 

— F4  =  &c(l) 

=  —  s~1t  Jq1  erA^~^  BxiBT  Sp  dt  +  St  erA<'1~t"1  (Ax  +  Bu)  dt  +  Fx  +  F3 
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Replacing  Sp  by  the  second  equation  in  (3.11)  we  find 

-  F4  =  -s~xt  I  eTA{1-t]BxiBT  eTAT^~t)8p{  1)  clt 

Jo 

s  'tSt  I  eTA(1-t)BXiBT  I  e-TATV-s]ATp{s)dsdt  + F2 

Jo  Jt 

+  St  f  e'rA<~1~t\Ax  +  Bu)  dt  +  F\  +  F3, 

Jo 

which  involves  5p(  1)  and  St  as  unknowns,  and  can  be  expressed  as 

(3.14)  AuSp(  1)  +  A12St  =  F1  +  F2  +  F3  +  F4  =:  r4. 

Eliminating  8u{l)  from  the  last  equation  in  (3.11)  by  means  of  the  third 
equation  implies 

(3.15)  A21Sp(l)  =  -F5  +  F3{l)T{BTp{l)  +  eu{  1))  +p(l)TAF4  =:  r2. 
Combining  (3.14)  and  (3.15)  we  obtain  the  following  linear  system  for  (<5p(l),  St)  : 


(3.16) 


A 


Sp{  1) 

St 


r  i 
r2 


By  (HI),  (H2),  and  (H4)  its  unique  solution  is  given  by 

(3.17)  St  =  {A2 1  A^i  Ai2)~l{A21  rx  -  r2),  Sp{  1)  =  A^(ri  -  A12St). 

Moreover  there  exists  a  constant  C  =  C(r,  |s|c(o,i),  |p|e(o,i)5  Im|l2(o.i)5  |u(l)|), 
such  that 

Ml)  |  +  |4"r|  <  C  \F\L2xL2xUxr- 
From  (3.5)  and  (3.11),  C  can  also  be  chosen  such  that 


(4.r.  Sp.  Su,  St)  ,y  <  C  \F\L2xL2xUxR. 


□ 
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Proof  of  Theorem  3.2.  We  apply  Theorem  3.1  with  x*  =  {x£.  p£,  ue,  r£). 
Lemma  3.1  implies  the  required  uniform  bound  of  the  generalized  inverses 
DF  in  the  neighborhood  U  C  DF  of  (xe,  p£,  u£,  t£).  Therefore  it  suffices  to 
argue  Newton-differentiability  of  F  in  DF.  This  is  obvious  for  all  coordinates 
of  F  except  for  F?l,  and  specifically  for  the  mapping 

F  :  P  cr£{BTp ) 

from  UPe  C  IU1,2(0,  1;  Kn)  — >  U.  Utilizing  the  definitions  of  UPe  and  o£  it 
suffices  to  consider  the  restriction  of  F  from  1U1,2(0,  f;  Mn)  to  L2( 0,  t\  M1) 
which  we  again  denote  by  F .  Note  that  T  can  be  decomposed  as 

F  =  F.\  o  F 2  °  F\ . 


where 

F\  :  1U1i2(0,  f;  K")  W1’2^,  f;  M),  F2  :  ^^(O,  f;  M)  ->•  L4(0,  f;  M), 

:  T4(0,  t;  M)  — >■  T2(0,  t;  M), 


are  given  by 

^i(«)  =  JU2(u)  =  max(  —  1,  -),  JU3(y)  =  min(l,u). 

t 

In  [HIK,  IK2]  it  was  shown  that  v  — )■  max(0,u)  is  Newton  differentiable 
from  Lp(n)  to  Lq(Q)  if  oc  >  p  >  q  >  1,  if  Q  is  a  bounded  domain.  Since 
m inf  1.  v)  =  1  +  min(0,  v  —  1)  this  implies  that  F3  and  similarly  that  F2  are 
Newton  differentiable.  From  the  chain  rule  for  Newton  differentiable  mapping 
in  [HK]  it  follows  that  F3  o  F2  is  Newton  differentiable.  The  chain  rule  for 
a  linear  mapping,  here  F\ .  followed  by  the  Newton  differentiable  mappings 
F30  F2,  [IK1],  implies  that  F  is  Newton  differentiable  in  DF .  □ 


4  A  numerical  example 

The  semi-smooth  Newton  method  is  used  to  solve  a  classical  time  optimal 
problem  related  to  the  harmonic  oscillator  with  three  switching  points.  We 
consider 
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(4.1) 


min . .  f0T  dt 

<  subject  to 

-^x(t)  =  Ax(t)  +  Bu(t ),  |ti(t)|  <  1,  x(0)  =  xq,  x(t)  =  xi, 


where 


A 


x  i  = 


The  optimal  minimal  time  for  the  continuous  problem  is  known  to  be  r*  = 
10.5871.  To  solve  (4.1)  numerically  a  time  discretization  based  on  the  Crank 
Nicolson  method  with  equidistant  grid  points  was  applied  to  (3.5).  The 
initialisation  for  the  state  was  chosen  as  a  semicircle  connecting  xq  and  x\ . 
Then  u(l)  was  chosen  to  be  active,  and  p  was  chosen  so  that  the  transversality 
condition  and  the  adjoint  equation  hold.  With  respect  to  the  choice  of  the 
parameter  c  -  we  utilized  a  continuation  procedure,  starting  with  a  small 
value  and  increasing  it,  using  the  solution  from  the  smaller  value  of  c  as 
initialization  for  the  next  larger  c- value.  Certainly  this  procedure  can  be 
automated  as  has  been  done  elsewhere,  but  this  was  not  the  focus  of  this 
paper.  In  Table  1  we  show  the  number  of  iterates  of  the  Netwon  iteration 
(outer  loop)  that  was  required  for  this  continuation  procedure  with  respect 
to  c.  The  Newton  iteration  was  stopped  when  the  residual  of  the  optimality 
system  in  the  L2-norm  was  below  ICC8.  Also  in  Table  1  we  depict  the  optimal 
minimal  times  r*(c).  These  results  are  obtained  for  meshsize  h  =  A  Then 


c 

1 

5 

10 

20 

No.  of  iterations 

8 

8 

4 

7 

Final  Time 

11.26515 

10.84455 

10.82977 

10.81781 

Table  1 


the  results  for  c  =  1  are  interpolated  to  the  finer  grid  h  =  -W  and  the 
continuation  procedure  with  respect  to  c  is  repeated.  The  results  are  depicted 
in  Table  2.  The  graphs  for  the  corresponding  controls  are  given  in  Figure  1. 

The  same  procedure  with  h  =  1/512  and  c  =  100  gives  the  optimal  time 
10.588.  In  some  cases,  typically  at  the  beginning  of  the  iterations  and  for 
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c 

1 

10 

50 

100 

200 

No.  of  iterations 

5 

46 

4 

4 

3 

Final  Time 

11.1088 

10.6092 

10.6034 

10.6033 

10.6031 

Table  2 


Figure  1:  N  =  128  and  c  =  l(left),  lO(middle)  and  lOO(right) 

the  lowest  values  of  c  the  full  Newton  step  was  too  large.  Therefore  we  used 
a  one-dimensional  line  search  based  on  a  quadratic  polynomial  interpolation 
for  the  L2—  norm  of  the  residual  combined  with  an  Armijo  rule. 

Table  3  depicts  the  quotients  ,  where  u*(c )  is  the  solution  to 

the  discretized  version  of  (2.17)  for  c  =  50.  It  shows  that  the  algorithm  is  in 
fact  superlinearly  convergent. 


No.  of  iterations 

1 

2 

3 

4 

Ck 

0.94138 

0.00037 

0.00001 

0.00000 

Table  3 


In  this  paper  we  chose  to  regularize  a  by  the  ramp  functions  aE  with 
increasing  slops  as  £  — »•  0+.  Certainly  other  alternatives  are  possible  as  for 
instance  ac(s )  =  |  atan(cs).  This  family  of  C°°—  functions  also  has  the 
property  that  it  converges  to  a  as  c  — >■  0,  but  it  appears  to  be  less  apt  for 
the  purpose  of  approximating  the  discontinuous  switching  structure  of  the 
optimal  controls  since  c  has  to  be  taken  significantly  larger  for  ac  than  for 
on  to  obtain  comparable  results. 

Acknowledgement:  We  thank  Mrs.  .J.  Rubesa  for  providing  us  with  the 
numerical  example. 
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